Molecular Descriptors Property Prediction Using Transformer-Based Approach

نویسندگان

چکیده

In this study, we introduce semi-supervised machine learning models designed to predict molecular properties. Our model employs a two-stage approach, involving pre-training and fine-tuning. Particularly, our leverages substantial amount of labeled unlabeled data consisting SMILES strings, text representation system for molecules. During the stage, capitalizes on Masked Language Model, which is widely used in natural language processing, chemical space representations. fine-tuning trained smaller dataset tackle specific downstream tasks, such as classification or regression. Preliminary results indicate that demonstrates comparable performance state-of-the-art chosen tasks from MoleculeNet. Additionally, reduce computational overhead, propose new approach taking advantage 3D compound structures calculating attention score end-to-end transformer anti-malaria drug candidates. The show using proposed score, able have with pre-trained models.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Systems Biological Approach of Molecular Descriptors Connectivity: Optimal Descriptors for Oral Bioavailability Prediction

BACKGROUND Poor oral bioavailability is an important parameter accounting for the failure of the drug candidates. Approximately, 50% of developing drugs fail because of unfavorable oral bioavailability. In silico prediction of oral bioavailability (%F) based on physiochemical properties are highly needed. Although many computational models have been developed to predict oral bioavailability, th...

متن کامل

Transformer protection using MLE approach

This paper proposed a new wavelet based method to identify inrush currents and to distinguish it from power system faults. The proposed algorithm extracts fault and inrush generated transient signals using DWT. Transient current signals at both sides of a transformer are firstly captured. The wavelet transform is a relatively new and powerful tool in the analysis of power transformer transient ...

متن کامل

In-silico prediction of Cellular Responses to Polymeric Biomaterials from Their Molecular Descriptors

In this work quantitative structure activity relationship (QSAR) methodology was applied for modeling and prediction of cellular response to polymers that have been designed for tissue engineering. After calculation and screening of molecular descriptors, linear and nonlinear models were developed by using multiple linear regressions (MLR) and artificial neural network (ANN) methods. The root m...

متن کامل

Prediction and Dissection of Protein-RNA Interactions by Molecular Descriptors.

Protein-RNA interactions play crucial roles in numerous biological processes. However, detecting the interactions and binding sites between protein and RNA by traditional experiments is still time consuming and labor costing. Thus, it is of importance to develop bioinformatics methods for predicting protein-RNA interactions and binding sites. Accurate prediction of protein-RNA interactions and ...

متن کامل

Prediction of pesticides chromatographic lipophilicity from the computational molecular descriptors.

Quantitative structure-property relationship models were developed for the prediction of pesticides and some PAH compounds lipophilicity based on a wide set of computational molecular descriptors and a set of experimental chromatographic data. The chromatographic lipophilicity of pesticides has been evaluated by high-performance liquid chromatography (HPLC) using different chemically bonded (C1...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Molecular Sciences

سال: 2023

ISSN: ['1661-6596', '1422-0067']

DOI: https://doi.org/10.3390/ijms241511948